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Abstract 

The Schrodinger equation for non-relativistic quantum systems is derived from some classical 
physics axioms within an ensemble hamiltonian framework. Such an approach enables one to 
understand the structure of the equation, in particular its linearity, in intuitive terms. Furthermore 
it allows for a physically motivated and systematic investigation of potential generalisations which 
are briefly discussed. 
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I. MOTIVATION 



It is often stated that of the two departures from Newtonian physics at the beginning of 
the twentieth century, relativity theory has a pleasing physical foundation while quantum 
theory is grounded more in abstract mathematical structures. Textbooks either quote the 
Schrodinger equation with little motivation or obtain it as the description of state evolution 
in a particular picture: but the reason for choosing states in a linear vector space in the first 
place is left unexplained. 

Schrodinger's original derivation, involving analogies with wave optics and various limits, 
is now considered only of heuristic value as he was then still unaware of the interpretation 
of the wavefunction as a probability amplitude rather than a physical wave. However, with 
hindsight, we know that Schrodinger's equation may be re-written in more familiar terms 
through a change of variables; starting from 

iKij)= \ -^d 2 + V V, (1) 
| 2m 

one performs a Madelung transformation Q ip = y/p t %ry l h which decomposes the Schrodinger 
equation into two real equations, 

2m 8m \ p z p J 

p + -di (p d l( j) = , (3) 
m 

the summation convention being used unless otherwise stated and the overdot referring to a 
partial time derivative. The first equation is a generalisation of the classical Hamilton- Jacobi 
equation, the term with explicit h dependence, the "quantum potential" , summarising the 
peculiar aspects of quantum theory. If the quantum potential is ignored then the equations 
have a simple classical interpretation: It is assumed that one is uncertain about the initial 
conditions so that probabilistic methods must be used to describe the location of the particle. 
With p(x, t) denoting the normalised probability density, the second equation of motion 
above is the continuity equation with a determining the velocity, Vi, through Vi = (di<j)/m. 
What transforms the classical ensemble dynamics into quantum mechanics is the quantum 
potential, the point of focus in the deBroglie-Bohm picture Q]. 

However the structure of the quantum potential is unusual, making the quantum 
Hamilton- Jacobi equation (j2J) difficult to understand in purely classical terms. Though 



some studies, such as those of Nelson and others [4] , have been made to derive (J2J from 
some stochastic micro-dynamics, the assumptions either go beyond familiar classical physics 
or introduce additional ingredients that raise new puzzles. 

A somewhat different approach has been to start with the plausible classical ensemble 
Hamilton- Jacobi equation, argue that it is incomplete, and then try to constrain possible 
extensions by some consistency requirements. Two such recent derivations are in The 
assumption of Hall and Reginatto was that the classical equation only described the mean 
motion of the particles, and that the momentum of the particles have some fluctuations 
about the mean value. It was postulated that those fluctuations obey an exact uncertainty 
relation |7], and that the fluctuation term also obeyed some axioms such as locality and 
separability. In on the other hand, the maximum uncertainty (entropy) principle ^] 
was used, as suggested earlier in 0, to constrain the probability distribution p(x,t). 
The constraint was implemented through a lagrange multiplier and the unique uncertainty 
measure that accompanies the lagrange multiplier was constructed from physically motivated 
axioms. The end result in both approaches is that the classical Hamilton- Jacobi equation 
gets a contribution, the quantum potential, with Planck's constant making its appearance 
to balance the dimensions between the old terms and the new. 

Once the quantum Hamilton- Jacobi equation (J2J) has been obtained, it and the continuity 
equation © represent two coupled nonlinear differential equations for the variables p, a. 
They can be uncoupled and linearised through the Madelung transformation resulting in the 
usual Schrodinger equation and its complex conjugate. The meaning behind the Madelung 
transformation was explained in [a]: it is a change to a canonically conjugate set of variables 
that would uncouple the equations. 

Thus in both of the approaches 0,0], only the extension of the classical equation was con- 
structed from various axioms: this still gives the impression that something extra and special 
must be imposed on classical ensemble dynamics to arrive at quantum theory. However, as 
this paper aims to demonstrate, one can proceed much further. 

The primary purpose of this paper is to present a set of physical axioms that can be used 
to construct the Schrodinger's equation directly, without assuming the classical ensemble 
Hamilton- Jacobi equation as the starting point, nor assuming any specific underlying dy- 



namics. For this a Hamiltonian framework for ensembles, as discussed in 



but the axioms will be refined from those used in [y]. This approach will achieve three goals: 



ll| , will be used 
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(i) It will show that it is possible to understand the structure of Schrodinger's equation in 
standard classical physics terms, (ii) emphasize that one may arrive at a quantum theory 
without "quantising" , in one way or another, some classical dynamics (which logically should 
be the limit of the quantum theory) and (hi) provide an avenue for physically motivated 
extensions of quantum theory that might be relevant for current studies of space at short 
distances. 

The axioms are listed and explained in the next section followed by the construction of the 
— —an in Sec,(3). T„ m ain d lfa ence s between M .Q and this paper ar e 
discussed in the concluding section together with some comments on potential generalisations 
of linear quantum theory. Some technical issues concerning Galilean invariance and gauge- 
inequivalence are discussed in the appendices. 

II. THE AXIOMS 

We wish to describe the dynamics of N particles of which we do not have sufficient 
information about the initial conditions, so that statistical methods must be used to locate 
the particles. Note that even for a single particle, N — 1, one has an ensemble dynamics. 
The discussion is being carried out for the general multiparticle case so that the separability 
axiom can be discussed. 

Let p(x, t) denote the normalised probability density for the N particles, with x sum- 
marising all the spatial coordinates. The following discussion will use Cartesian coordinates 

in d + 1 dimensions, with configuration space indices i,j = 1,2, , dN. Here i = l,...d, 

refer to the coordinates of the first particle of mass mi, i = d + 1, 2c?, to those of the 

second particle of mass m 2 and so on. A diagonal and positive definite configuration space 
metric, = Sij/m^, with the symbol (i) defined as the smallest integer > i/d, is assumed 
as in classical dynamics j^. That metric not only encodes information about the inertia 
(mass) of the particles, which can be different, the indices allow contraction with derivatives 
and enable a useful summary of the spacetime symmetries that will be assumed below. 

Let H be the ensemble hamiltonian depending on the normalised probability density 
p(x, t) and let S(x, t) denote the canonically conjugate variable, 




(4) 
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with V some external potential influencing the particles' motion. Hamilton's equations are 

dp 6H 

at ~ ~ss J {b) 

dS 6H 



at 5 P ■ (6) 

Since p is the probability density, the first equation will turn out to be the continuity 
equation. 

The objective is to construct an explicit form for h(p, S) in (J1J starting from the following 
reasonable axioms: 

• [AO] Hamiltonian: The existence of an ensemble hamiltonian of the form (J3J, evo- 
lution determined by Hamilton's equations, and a configuration space metric with the 
stated properties, may be formalised in this opening axiom. 

• [Al] Locality: h should be a function of p, S, and their spatial derivatives. That is, 
for example, h should not involve any integrals as that would couple fields at distant 
points and create problems with causality. Note that disallowing time-derivatives in 
h ensures that hamilton's equations (J5I6|) have time-derivatives only on the left-hand- 
side. 

• [A2] Separability: h should be separable for the case of two independent sub-systems 
described by probability distributions p\ and p 2 ; h(p = P1P2) = h{.Pi) + h(p 2 ) so that 
H itself can be written as the sum of two independent terms. The factorisation of p 
for separable systems will affect how S behaves through Hamilton's equations; it is 
sufficient for consistency to require S — S± + S^. 

• [A3] Symmetry: The observed symmetries of Nature should be included in the de- 
scription of the system. Since we are working in the non-relativistic limit, the equations 
of motion should be form- invariant under the Galilean group \v\ . The translational 
part of this symmetry can be used to eliminate any explicit dependence of h on x^t 
while rotational invariance will be used as an explicit constraint in the construction be- 
low. However invariance under Galilean boosts can only be imposed on the equations 
of motion, rather than on h, as that involves transformations of the time-derivatives: 
remarkably though, the equations that are obtained below, after imposing the other 
conditions, are already invariant under boosts (see Appendix A). 



• [AA\ Universality: It is desirable to construct a hamiltonian that describes universal 
dynamics, that is, h should be independent of V and any other specific properties of the 
particles or their number except those encoded in the configuration-space metric. An 
example of how this universality constraint can be used is as follows: the normalisation 
of probability, 1 = J dx Nd p(x,t), implies that the dimension of p(x,t) depends on the 
dimension of the configuration space. Thus, as elaborated below, H can be universal 
only if h is scale invariant, h(Xp) = h{p). This scale invariance will ensure that the 
resulting equations of motion have a form independent of the number of particles. 
Hamilton's equations show that the scale-invariance condition does not affect S. 

It must be emphasized that universality as defined above implies much more than 
simply the statement that the terms in h be independent of the dimension of con- 
figuration space. For example, to compensate for the dimension of p one might just 
try using iV powers of x, but since translational invariance (for V = 0) disallows 
explicit dependence on x, one must resort to derivatives. So consider the candidate 
h = p~ 1 {d\d2 dftlogp): its dimension is independent of the dimension of configu- 
ration space and yet it is not scale invariant. However this example clearly does not 
lead to universal dynamics because the form of the equations of motion change as the 
number of particles changes, with higher-derivative terms appearing with increasing 
N . This shows that imposing universality fully, as defined, clearly disallows compen- 
sating the dimension of p with factors of x or d/dx. Which means that H can be 
universal only if h is scale invariant, as stated above. 

A beneficial technical outcome from the scale invariance of h is that one need not 
impose the normalisation condition on p in the hamiltonian but rather postpone it to 
a latter stage, for example after solving the equations of motion. 

• [A5] Positivity: The existence of a stable ground state requires that the hamiltonian 
H be bounded from below for potentials V that are likewise bounded. For this to 
be true for generic potentials, and thus lead to universal dynamics, it is sufficient to 
choose h to be positive definite. 

• [A6] Simplicity: The fewer arbirary parameters a scientific theory has, the easier it 
can be falsified experimentally and so lead to suggestions for improvement. Further- 
more, if a simple theory is able to explain all the available data then one is more likely 
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to accept it as something "fundamental" rather than an intermediate phenomenological 
description. Indeed, a useful working hypothesis in physics has been that fundamental 
laws should be universal and simple. 

Thus one would like to construct a hamiltonian that has a minimum number of arbi- 
trary constants. For example, the scale invariance of h deduced in [A4] means that 
h must contain derivatives of p if it is to depend on p. Rotational symmetry implies 
that at least two derivatives would be required. Therefore one constraint is to let h 
contain not more than two derivatives in any product of terms that appears in it. As 
each derivative involves an inverse length, this condition obviously restricts the num- 
ber of new dimensional parameters, beyond the metric, that can appear in the action. 
This specific implementation of the simplicity condition, whereby not more than two 
derivatives appear in any product of terms, will be referred to in brief as "absence of 
higher number of derivatives" or "AHD" . 

It is implicit in the foregoing discussion that one would like to construct and describe 
some realistic dynamics and so the simplicity axiom should be interpreted in that 
context. So, for example, the "simplest" suggestion h = is vacuous, and as we 
shall see later, the constant A in ()12|) must be chosen to be non-zero for nontrivial 
dynamics. Thus clearly one is searching for equations that have a minimum number 
of free parameters and yet describe interesting dynamical systems. This approach, of 
starting with the simplest nontrivial dynamics, allows for systematic extensions that 
are briefly discussed in the concluding section. 

III. CONSTRUCTION OF THE HAMILTONIAN 

Using the axioms one may construct an explicit form for h(p, S). Rotational invariance 
and AHD imply that the building blocks of h must be 

Wx, gijUxid^idjUs) and g^d^W^ (7) 

where the Ui,Wi,l = 1, 2, 3 are real functions of p, S only (and not also of their derivatives). 
Separability restricts h to be linear in g^: 

h = W 1 + 9ij (U^diUtoMUto) + W 2n d i d j W Zn ) (8) 
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where a possible additional index n is summed over. 

Consider Wi(p, S). Scale- invariance implies it cannot depend on p, so W\ = Wi(S). But 
now separability requires W% oc S. However in general the final result violates positivity of 
H and hence one concludes that W\ = 0. 

The terms involving W2 and W3 cannot generically lead to a positive definite density h 
and so by the universality axiom we must set those terms to zero. Similarly, positivity of h 
in the general case (universality) implies JJ^n —U^n and U\ n > so that h becomes a sum 
of positive terms. (One can actually avoid imposing the conditions in this paragraph and 
proceed as in p, but the discussion is more concise this way). 

Hence 

h = 9ijUi n (diU2n)(djU2n) ■ (9) 

Next, using the chain rule, 

dU dU 

^ = ^— + 9^—, (10) 

one can extract explicitly the derivatives of p, S in the above expression. Then using the 
separability axiom one deduces, as in jfjj], that various combination of terms must be just 
constants. 
Thus 



h = 9ij AidiSXdjS) + logp)(a,- logp) + K&QogP + a n S))(d 3 (\ogp + a n S)) 




with a n and A,B,b n non- negative. The result has been written in a form that emphasizes 
positivity. If one expanded out the terms in the sum over n, then the non-cross terms can 
be combined with the A and B forms, giving new non-negative coefficients A and B, and 
leaving cross-terms involving (dilogp)(djS) with a net coefficient C = 2^2a n b n . Although 
the equations of motion would only depend on A, B, C one would still need to impose the 
positivity constraint which makes C depend on A, B through the a n , b n . 

Thus the dynamics appears to depend on the unlimited number of free parameters a n , b n . 
The simplicity axiom encourages one to reduce this dependence, in the first instance, by 
choosing b n = 0, thereby truncating h to 

h = g tJ (A(d l S)(d J S) + B(d l logp)(d,logp)) . (12) 
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A consequence of this diagonal form for h is another simplification, for it leads to the 
usual continuity equation (JTljl where the particle velocities are independent of any explicit 
dependence on the probability density p. 

Since A must be non-zero for nontrivial dynamics, it can be absorbed in a redefinition of 
the metric. The final result for h therefore depends only on a single new universal parameter 
B with dimensions of action-squared. If B = then one has classical ensemble dynamics 
generalising the usual Hamilton- Jacobi description of classical mechanics. For nonzero B, 
the resulting hamilton's equations are, in the conventional normalisation A = 1/2, 

s + fa iS a jS + v + Bgii - = o , (is) 

P + S« OApdjS) = 0. (14) 



These real nonlinear equations can be combined and rewritten, via the inverse Madelung 
transformation, as the standard Schrodinger equation when B is identified with h 2 /8, 



ihip 



2 



—gijdidj + V 



V>- (15) 



Thus the same set of axioms have allowed us to obtain two theories: Classical ensemble 
dynamics for B = in (jl2j) and the non-relativistic (and linear) Schrodinger equation for 
B > 0. The linear quantum theory is thus seen to be a single parameter extension of the 

n 

classical theory [18]. 

IV. CONCLUSION 

A main result of this investigation is that one may derive and understand the structure 



of Schrodinger's equation using intuitive classical concepts and axioms. 



n particular its 
where the 



in 
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linearity is seen to be a consequence of the other assumptions. Unlike 
quantum action was obtained by extending a given classical action, here both the classical 
and quantum dynamics were constructed from a single set of axioms. 

Although the axioms used here appear similar to those used in ((J, there are a number 
of crucial differences that should be highlighted. While positivity was used in [f| primarily 
to give the (inverse) uncertainty measure a sensible interpretation, the axiom [A5] adopted 
here has been motivated by the need for a Hamiltonian bounded from below. In (f| scale- 
invariance of h was demanded as a sufficient condition for universality, while here scale- 
invariance has been argued to be a consequence of the broader requirement of universality. 



While the AHD condition was imposed in [6|, here the broader simplicity axiom has been 
used, of which AHD is a natural special case. Finally, the axioms in (f| were imposed on 
the (inverse) uncertainty measure that was added to the classical Lagrangian while here the 
axioms were used to construct the ensemble Hamiltonian. 

As is manifest in Sect. (3), the universality and simplicity axioms have been used in 
ways that go beyond the related but narrower homogeneity and AHD conditions that were 
used in Ref.p]. The broader conditions have been adopted so as to accomplish the wider 
scope of the construction: the full ensemble Hamiltonian in this paper versus a piece of 
the Lagrangian in Ref. Nonetheless, the more general axioms used here have a natural 
physical interpretation, as discussed above, perhaps even more so than those used in Ref.jf]]. 

Further insight into the results of Sect. (3) can be obtained by enquiring about the type 
of equations that would result if one abandoned one or more of the axioms. For example, 
allowing higher number of derivatives of p enables terms like Q| 

h(p) = QijdiQogp + r]f(p))d j (\ogp + r)f(p)) (16) 

with f(p) = gki{dk logp)(<9; logp) and r\ a constant. This h\ satisfies all the constraints except 
AHD. Dimensional analysis shows that one must introduce a new length scale associated with 
such nonlinearities. Requiring universality implies that the nonlinear terms are associated 
with a universal length scale, a natural candidate being the Planck length. Thus in this way 
one sees a possible link between gravity and nonlinear corrections to Schrodinger's equation 



12. 



131 ] . Such generalisations, and their interpretation in terms of short- distance physics, 
are discussed at greater length in Ref.Q] which also lists related literature. 

A challenge is to extend the axiomatic construction to include fermions. This might 
involve the use of additional mathematical structures like Grassmann variables to summarise 
internal degrees of freedom, and a further refinement of the axioms. 



V. APPENDIX A: GALILEAN INVARIANCE 



The equations of motion that follow from (jllj) . for V = 0, are 

S + Ag^SdjS + Bg i3 - - Candida = , (17) 

p + 2A 9ij di (p d,S) + Ci,, ;,<),<),,, = , (18) 
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with A, B, C as defined earlier. One may wonder if demanding invariance of these equations 
under Galilean boosts, which have not yet been imposed, yields any constraints on the 
undetermined constants. 

As the essential features are manifest already for a single particle in one space dimension, 
consider that uncluttered case first. The relevant equations are 

m m \ p 2 p J m 

2 A C 
p + — d (p dS) + —d 2 p = , (20) 
m m 

and one would like these equations to be form invariant under the transformation tf = t,x' = 
x — ut, with u the boost velocity. Since p(x,t)dx is the probability of finding the particle 
in the region around x, it is required to be invariant under coordinate transformations. But 
dx = dx' and so one deduces that p'(x',t') = p(x,t) where p' is the probability density in 
the primed frame. 

The transformation of the coordinates induces an obvious tranformation of the deriva- 
tives: d/dx' = d/dx and d/dt' = d/dt + ud/dx. Start with the continuity equation in 
the primed frame and require it to have the same form as (j20j) . that is, with the unprimed 
quantities replaced by primed quantities. Then subtracting the primed equation from the 
unprimed equation and solving for S' gives the transformation rule, 

S'(x', = S(x, t)-~ + ^f(t) Jj + g(t) , (21) 

where /, g are functions of t to be fixed next. 

Now compare the primed version of ()19j) with the unprimed version and use (|21|) to 
conclude 

f(t) = 0, (22) 

9(t) = -4^* + ^ ( 23 ) 

where is a constant. Thus the non-trivial transformation of S under Galilean boosts has 
been determined: notice that the result is independent of B and C! The independence from 
B is due to the fact that B multiplies terms in the equations of motion which only depend on 
p and its spatial derivatives and those structures are invariant by themselves under boosts. 
The independence from C can be understood as follows: The C = equations are invariant 
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under (|21l [2"2l l2*^j) which depend explicitly on x only linearly. Thus the C term in the 
equations of motion, which involves two spatial derivatives must be invariant under the 
same transformation. 

Therefore one deduces that the transformation of the variables p, S under Galilean trans- 
formations is the same in classical ensemble dynamics (B = 0, C = 0), in linear quantum 
theory (B ^ 0, C = 0), and in the nonlinear theory with C ^ 0. The transformations do 
depend on the constant A but since that is just a normalisation factor it can be removed by 
redefining the metric (mass). 

Recalling the Madelung change of variables, it is not surprsingly that the transformation 
of S found above is precisely the transformation of the phase of the Schrodinger wave 
function under Galilean boosts discussed, for example, in [l5|. It should also be noted that 
the explicit linear dependence of S' on the product ux is consistent with the interpretation 
of dS/m as the velocity of the particle mentioned in Sect.(l): it transforms correctly under 
boosts. 

For the multidimensional case (|17|18|) the transformations of the coordinates and deriva- 
tives under Galilean boosts are 

t' = t, (24) 
x\ = Xi- U{t, (25) 
d[ = d h (26) 
d t , = d t + Ujdj . (27) 

The metric gij and probability density p remain invariant. Then, as above, one deduces the 
transformation of S (setting for convenience A = 1/2), 

S'(x', = S{x, t) - g ijUiXj + Wp-t + (28) 

where g is the inverse metric with diagonal coefficients gu = 1/gu and zero otherwise. The 
constant parameter <p actually represents the global gauge invariance of the equations and 
corresponds to the conservation of probability. 
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VI. APPENDIX B: GAUGE INEQUIVALENCE 



Setting A = 1/2 and B = h 2 /8, one may re-write the coupled equations ()17I18|) in terms 
of the wavefunction ip = ^Jpe lS l h , 

ittoj) = H s t/j + , (29) 



F^cfM^-^i). (30) 



with H s the usual Schrodinger Hamiltonian and F a nonlinear correction given by 

/ (djp)Jj _ di, 
\ p 2 p 

The current 

J% = 9ijPdjS (31) 
= ^9i 3 {Vd^-^d 3 r) (32) 

is that which appears in the continuity equation 

p + d i J i = 0. (33) 

The equation (|29l) belongs to a class of Galilean invariant nonlinear Schrodinger equations 
obtained in Ref. jl^. In terms of the structures R\, i? 4 that are defined in |l^ |. 

F = -C(i? 1 - R A ) . (34) 

One may ask if the nonlinear piece F may be eliminated through some change of variables 
in the equation (J2HJ), leading to a physically equivalent linear equation. The results of [l^ 
however show that for real nonlinearities the only nonlinear Schrodinger equations that are 
equivalent, through a nonlinear gauge transformation that keeps p(x,t) invariant, to the 
linear Schrodinger equation are those for which the nonlinearity is proportional to the usual 
quantum potential Q. Since the F term above is not proportional to Q, the nonlinear 
Schrodinger equation is not equivalent to a linear Schrodinger equation. 

It must be emphasized that although the nonlinear equation with C / belongs to 
the class considered in the equation derived here has some positivity constraints on 
the coefficients that come from the positivity imposed on the ensemble Hamiltonian, as 



13 



discussed in the text. Such constraints are absent for the nonlinear equations in [16| which 
were constructed using a different approach. 
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